A Randomized Online Quantile Summary in O(1
نویسندگان
چکیده
A quantile summary is a data structure that approximates to ε-relative error the order statistics of a much larger underlying dataset. In this paper we develop a randomized online quantile summary for the cash register data input model and comparison data domain model that uses O( ε log 1 ε ) words of memory. This improves upon the previous best upper bound of O( ε log 3/2 1 ε ) by Agarwal et al. [1]. Further, by a lower bound of Hung and Ting [4] no deterministic summary for the comparison model can outperform our randomized summary in terms of space complexity. Lastly, our summary has the nice property that O( ε log 1 ε ) words suffice to ensure that the success probability is 1−e −poly(1/ε). 1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems, G.3 Probability and Statistics
منابع مشابه
A randomized online quantile summary in $O(\frac{1}{\varepsilon} \log \frac{1}{\varepsilon})$ words
A quantile summary is a data structure that approximates to ε-relative error the order statistics of a much larger underlying dataset. In this paper we develop a randomized online quantile summary for the cash register data input model and comparison data domain model that uses O( 1 ε log 1 ε ) words of memory. This improves upon the previous best upper bound of O( 1 ε log 1 ε ) by Agarwal et. ...
متن کاملA Randomized Online Quantile Summary in O(1/epsilon * log(1/epsilon)) Words
A quantile summary is a data structure that approximates to ε error the order statistics of a much larger underlying dataset. In this paper we develop a randomized online quantile summary for the cash register data input model and comparison data domain model that uses O((1/ε) log(1/ε)) words of memory. This improves upon the previous best upper bound of O((1/ε) log3/2(1/ε)) by Agarwal et al. (...
متن کاملA randomized online quantile summary in O(1/ɛ log 1/ɛ) words
A quantile summary is a data structure that approximates to ε-relative error the order statistics of a much larger underlying dataset. In this paper we develop a randomized online quantile summary for the cash register data input model and comparison data domain model that uses O( ε log 1 ε ) words of memory. This improves upon the previous best upper bound of O( ε log 3/2 1 ε ) by Agarwal et a...
متن کاملSpace Efficient Quantile Summary for Constrained Sliding Windows on a Data Stream
In many online applications, we need to maintain quantile statistics for a sliding window on a data stream. The sliding windows in natural form are defined as the most recent N data items. In this paper, we study the problem of estimating quantiles over other types of sliding windows. We present a uniform framework to process quantile queries for time constrained and filter based sliding window...
متن کاملPhysiologic mechanisms can predict hematologic responses to iron supplements in growing children: a computer simulation model.
BACKGROUND Iron deficiency is the most common preventable nutrition problem in developing countries. Several randomized clinical trials (RCTs) have been conducted to determine the effectiveness of various iron dosing schemes in multiple settings. OBJECTIVE The objective was to determine whether enough is known about iron metabolism to predict hemoglobin and serum ferritin (SF) concentrations ...
متن کامل